135 research outputs found

    Transposon insertion mapping with PIMMS, Pragmatic Insertional Mutation Mapping System

    Get PDF
    The PIMMS (Pragmatic Insertional Mutation Mapping System) pipeline has beendeveloped for simple conditionally essential genome discovery experiments in bacteria.Capable of using raw sequence data files alongside a FASTA sequence of thereference genome and GFF file, PIMMS will generate a tabulated output of each codingsequence with corresponding mapped insertions accompanied with normalized resultsenabling streamlined analysis. This allows for a quick assay of the genome to identifyconditionally essential genes on a standar d desktop computer prioritizing results forfurther investigation

    Identification of DNA methylation biomarkers from Infinium arrays

    Get PDF
    Epigenetic modifications of DNA, such as cytosine methylation are differentially abundant in diseases such as cancer. A goal for clinical research is finding sites that are differentially methylated between groups of samples to act as potential biomarkers for disease outcome. However, clinical samples are often limited in availability, represent a heterogeneous collection of cells or are of uncertain clinical class. Array based methods for identification of methylation provide a cost effective method to survey a proportion of the methylome at single base resolution. The Illumina Infinium array has become a popular and reliable high throughput method in this field and are proving useful in the identification of biomarkers for disease. Here, we compare a commonly used statistical test with a new intuitive and flexible computational approach to quickly detect differentially methylated sites. The method rapidly identifies and ranks candidate lists with greatest inter-group variability whilst controlling for intra-group variability. Intuitive and biologically relevant filters can be imposed to quickly identify sites and genes of interest

    Duplicated Paralogous Genes Subject to Positive Selection in the Genome of Trypanosoma brucei

    Get PDF
    Background Whole genome studies have highlighted duplicated genes as important substrates for adaptive evolution. We have investigated adaptive evolution in this class of genes in the human parasite Trypanosoma brucei, as indicated by the ratio of non-synonymous (amino-acid changing) to synonymous (amino acid retaining) nucleotide substitution rates. Methodology/Principal Findings We have identified duplicated genes that are most rapidly evolving in this important human parasite. This is the first attempt to investigate adaptive evolution in this species at the codon level. We identify 109 genes within 23 clusters of paralogous gene expansions to be subject to positive selection. Conclusions/Significance Genes identified include surface antigens in both the mammalian and insect host life cycle stage suggesting that competitive interaction is not solely with the adaptive immune system of the mammalian host. Also surface transporters related to drug resistance and genes related to developmental progression are detected. We discuss how adaptive evolution of these genes may highlight lineage specific processes essential for parasite survival. We also discuss the implications of adaptive evolution of these targets for parasite biology and control

    Triaging informative cis-regulatory elements for the combinatorial control of temporal gene expression during Plasmodium falciparum intraerythrocytic development

    Get PDF
    Background: Over 2700 genes are subject to stage-specific regulation during the intraerythrocytic development of the human malaria parasite Plasmodium falciparum. Bioinformatic analyses have identified a large number of over-represented motifs in the 5′ flanking regions of these genes that may act as cis-acting factors in the promoter-based control of temporal expression. Triaging these lists to provide candidates most likely to play a role in regulating temporal expression is challenging, but important if we are to effectively design in vitro studies to validate this role. Methods: We report here the application of a repeated search of variations of 5′ flanking sequences from P. falciparum using the Finding Informative Regulatory Elements (FIRE) algorithm. Results: Our approach repeatedly found a short-list of high scoring DNA motifs, for which cognate specific transcription factors were available, that appear to be typically associated with upregulation of mRNA accumulation during the first half of intraerythrocytic development. Conclusions: We propose these cis-trans interactions may provide a combinatorial promoter-based control of gene expression to complement more global mechanisms of gene regulation that can account for temporal control during the second half of intraerythrocytic development

    A consensus approach to vertebrate de novo transcriptome assembly from RNA-seq data: assembly of the duck (Anas platyrhynchos) transcriptome

    Get PDF
    For vertebrate organisms where a reference genome is not available, de novo transcriptome assembly enables a cost effective insight into the identification of tissue specific or differentially expressed genes and variation of the coding part of the genome. However, since there are a number of different tools and parameters that can be used to reconstruct transcripts, it is difficult to determine an optimal method. Here we suggest a pipeline based on (1) assessing the performance of three different assembly tools (2) using both single and multiple k-mer (MK) approaches (3) examining the influence of the number of reads used in the assembly (4) merging assemblies from different tools. We use an example dataset from the vertebrate Anas platyrhynchos domestica (Pekin duck). We find that taking a subset of data enables a robust assembly to be produced by multiple methods without the need for very high memory capacity. The use of reads mapped back to transcripts (RMBT) and CEGMA (Core Eukaryotic Genes Mapping Approach) provides useful metrics to determine the completeness of assembly obtained. For this dataset the use of MK in the assembly generated a more complete assembly as measured by greater number of RMBT and CEGMA score. Merged single k-mer assemblies are generally smaller but consist of longer transcripts, suggesting an assembly consisting of fewer fragmented transcripts. We suggest that the use of a subset of reads during assembly allows the relatively rapid investigation of assembly characteristics and can guide the user to the most appropriate transcriptome for particular downstream use. Transcriptomes generated by the compared assembly methods and the final merged assembly are freely available for download at http://dx.doi.org/10.6084/m9.figshare.1032613

    Diverse Spatial, Temporal, and Sexual Expression of Recently Duplicated Androgen-Binding Protein Genes in \u3ci\u3eMus musculus\u3c/i\u3e

    Get PDF
    Background The genes for salivary androgen-binding protein (ABP) subunits have been evolving rapidly in ancestors of the house mouse Mus musculus, as evidenced both by recent and extensive gene duplication and by high ratios of nonsynonymous to synonymous nucleotide substitution rates. This makes ABP an appropriate model system with which to investigate how recent adaptive evolution of paralogous genes results in functional innovation (neofunctionalization). Results It was our goal to find evidence for the expression of as many of the Abp paralogues in the mouse genome as possible. We observed expression of six Abpa paralogues and five Abpbg paralogues in ten glands and other organs located predominantly in the head and neck (olfactory lobe of the brain, three salivary glands, lacrimal gland, Harderian gland, vomeronasal organ, and major olfactory epithelium). These Abp paralogues differed dramatically in their specific expression in these different glands and in their sexual dimorphism of expression. We also studied the appearance of expression in both late-stage embryos and postnatal animals prior to puberty and found significantly different timing of the onset of expression among the various paralogues. Conclusion The multiple changes in the spatial expression profile of these genes resulting in various combinations of expression in glands and other organs in the head and face of the mouse strongly suggest that neofunctionalization of these genes, driven by adaptive evolution, has occurred following duplication. The extensive diversification in expression of this family of proteins provides two lines of evidence for a pheromonal role for ABP: 1) different patterns of Abpa/Abpbg expression in different glands; and 2) sexual dimorphism in the expression of the paralogues in a subset of those glands. These expression patterns differ dramatically among various glands that are located almost exclusively in the head and neck, where the sensory organs are located. Since mice are nocturnal, it is expected that they will make extensive use of olfactory as opposed to visual cues. The glands expressing Abp paralogues produce secretions (lacrimal and salivary) or detect odors (MOE and VNO) and thus it appears highly likely that ABP proteins play a role in olfactory communication

    Comparison of Clustering Methods for Investigation of Genome-Wide Methylation Array Data

    Get PDF
    The use of genome-wide methylation arrays has proved very informative to investigate both clinical and biological questions in human epigenomics. The use of clustering methods either for exploration of these data or to compare to an a priori grouping, e.g., normal versus disease allows assessment of groupings of data without user bias. However no consensus on the methods to use for clustering of methylation array approaches has been reached. To determine the most appropriate clustering method for analysis of illumina array methylation data, a collection of data sets was simulated and used to compare clustering methods. Both hierarchical clustering and non-hierarchical clustering methods (k-means, k-medoids, and fuzzy clustering algorithms) were compared using a range of distance and linkage methods. As no single method consistently outperformed others across different simulations, we propose a method to capture the best clustering outcome based on an additional measure, the silhouette width. This approach produced a consistently higher cluster accuracy compared to using any one method in isolation

    Transposon insertion mapping with PIMMS, Pragmatic Insertional Mutation Mapping System

    Get PDF
    The PIMMS (Pragmatic Insertional Mutation Mapping System) pipeline has beendeveloped for simple conditionally essential genome discovery experiments in bacteria.Capable of using raw sequence data files alongside a FASTA sequence of thereference genome and GFF file, PIMMS will generate a tabulated output of each codingsequence with corresponding mapped insertions accompanied with normalized resultsenabling streamlined analysis. This allows for a quick assay of the genome to identifyconditionally essential genes on a standar d desktop computer prioritizing results forfurther investigation

    Method for RNA extraction and transcriptomic analysis of single fungal spores

    Get PDF
    Transcriptomic analysis of single cells has been increasingly in demand in recent years, thanks to technological and methodological advances as well as growing recognition of the importance of individuals in biological systems. However, the majority of these studies have been performed in mammalian cells, due to their ease of lysis and high RNA content. No single cell transcriptomic analysis has yet been applied to microbial spores, even though it is known that heterogeneity at the phenotype level exists among individual spores. Transcriptomic analysis of single spores is challenging, in part due to the physically robust nature of the spore wall. This precludes the use of methods commonly used for mammalian cells. Here, we describe a simple method for extraction and amplification of transcripts from single fungal conidia (asexual spores), and its application in single-cell transcriptomics studies. The method can also be used for studies of small numbers of fungal conidia, which may be necessary in the case of limited sample availability, low-abundance transcripts or interest in small subpopulations of conidia.• The method allows detection of transcripts from single conidia of Aspergillus niger• The method allows detection of genomic DNA from single conidia of Aspergillus nige
    corecore